多模式学习通过在预测过程中同样组合多个输入数据模式来重点关注培训模型。但是,这种相等的组合可能不利于预测准确性,因为不同的方式通常伴随着不同水平的不确定性。通过几种方法研究了使用这种不确定性来组合模式,但是成功有限,因为这些方法旨在处理特定的分类或细分问题,并且不能轻易地转化为其他任务,或者遭受数值的不稳定性。在本文中,我们提出了一种新的不确定性多模式学习者,该学习者通过通过跨模式随机网络预测(CRNP)测量特征密度来估计不确定性。 CRNP旨在几乎不需要适应来在不同的预测任务之间转换,同时进行稳定的培训过程。从技术角度来看,CRNP是探索随机网络预测以估算不确定性并结合多模式数据的第一种方法。对两个3D多模式医学图像分割任务和三个2D多模式计算机视觉分类任务的实验显示了CRNP的有效性,适应性和鲁棒性。此外,我们提供了有关不同融合功能和可视化的广泛讨论,以验证提出的模型。
translated by 谷歌翻译
We present a Machine Learning (ML) study case to illustrate the challenges of clinical translation for a real-time AI-empowered echocardiography system with data of ICU patients in LMICs. Such ML case study includes data preparation, curation and labelling from 2D Ultrasound videos of 31 ICU patients in LMICs and model selection, validation and deployment of three thinner neural networks to classify apical four-chamber view. Results of the ML heuristics showed the promising implementation, validation and application of thinner networks to classify 4CV with limited datasets. We conclude this work mentioning the need for (a) datasets to improve diversity of demographics, diseases, and (b) the need of further investigations of thinner models to be run and implemented in low-cost hardware to be clinically translated in the ICU in LMICs. The code and other resources to reproduce this work are available at https://github.com/vital-ultrasound/ai-assisted-echocardiography-for-low-resource-countries.
translated by 谷歌翻译
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.
translated by 谷歌翻译
Like fingerprints, cortical folding patterns are unique to each brain even though they follow a general species-specific organization. Some folding patterns have been linked with neurodevelopmental disorders. However, due to the high inter-individual variability, the identification of rare folding patterns that could become biomarkers remains a very complex task. This paper proposes a novel unsupervised deep learning approach to identify rare folding patterns and assess the degree of deviations that can be detected. To this end, we preprocess the brain MR images to focus the learning on the folding morphology and train a beta-VAE to model the inter-individual variability of the folding. We compare the detection power of the latent space and of the reconstruction errors, using synthetic benchmarks and one actual rare configuration related to the central sulcus. Finally, we assess the generalization of our method on a developmental anomaly located in another region. Our results suggest that this method enables encoding relevant folding characteristics that can be enlightened and better interpreted based on the generative power of the beta-VAE. The latent space and the reconstruction errors bring complementary information and enable the identification of rare patterns of different nature. This method generalizes well to a different region on another dataset. Code is available at https://github.com/neurospin-projects/2022_lguillon_rare_folding_detection.
translated by 谷歌翻译
本文描述了(r)ules(o)f(t)he(r)oad(a)dvisor,该代理提供了推荐的和可能从一组人级规则生成的动作。我们以形式和示例描述了Rotra的架构和设计。具体来说,我们使用Rotra正式化和实施英国“道路规则”,并描述如何将其纳入自动驾驶汽车中,从而可以内部推荐遵守道路规则。此外,根据《英国公路法典》(《道路规则》),规定规则是否必须采取行动,或者仅建议采取行动,以指示生成的可能的措施。利用该系统的好处包括能够适应不同司法管辖区的不同法规;允许从规则到行为的清晰可追溯性,并提供外部自动责任机制,可以检查在某些给定情况下是否遵守规则。通过具体的示例,对自动驾驶汽车的模拟显示如何通过将自动驾驶汽车放置在许多情况下,这些场景测试了汽车遵守道路规则的能力。合并该系统的自动驾驶汽车能够确保他们遵守道路和外部(法律或监管机构的规则透明工作,从而使汽车公司,司法管辖区和公众之间的信任更大。
translated by 谷歌翻译
在复杂的动态环境中,有效的轨迹产生在无人体表面车辆(USV)域中仍然是一个开放的问题。在本文中,提出了针对USV-UAV系统的合作轨迹计划算法,以确保USV可以在多障碍物图中的自主进步过程中执行安全,平稳的路径。具体而言,无人机(UAV)扮演飞行传感器的角色,并提供了实时的全球地图和障碍信息,并具有轻巧的语义细分网络和3D投影转换。然后通过基于图的搜索方法生成初始的避免轨迹。关于USV的独特运动不足的运动学特性,引入了基于船体动态约束的数值优化方法,以使该轨迹易于跟踪进行运动控制。最后,提出了基于在执行过程中具有最低能量消耗限制的NMPC的运动控制方法。实验结果验证了整个系统的有效性,并且生成的轨迹在局部对USV始终具有相当大的跟踪精度。
translated by 谷歌翻译
本文比较了软件定义网络中的网络安全性的两种深入强化学习方法。对深Q网络的神经情节控制已实施,并将其与双重深Q网络进行了比较。这两种算法以类似于零和游戏的格式实现。对两个游戏结果进行了两尾t检验分析,其中包含为防守者赢得的冠军的数量。另一个比较是在各自游戏中代理商的游戏得分上进行的。进行分析是为了确定哪种算法是游戏表演者最好的算法,以及它们之间是否存在显着差异,证明一个算法是否会更偏爱另一个。发现两种方法之间没有显着统计差异。
translated by 谷歌翻译
从机器学习的角度来看,当前的语音识别体系结构的表现非常出色,因此用户互动。这表明他们很好地模拟了人类生物系统。我们调查是否可以颠倒推论以提供对该生物系统的见解。特别是听力机制。使用SINCNET,我们确认端到端系统确实学习了众所周知的滤纸结构。但是,我们还表明,在学习结构中,更宽的带宽过滤器很重要。虽然可以通过初始化狭窄和宽带过滤器来获得一些好处,但生理上的限制表明,这种过滤器是在中脑而不是耳蜗中出现的。我们表明,必须修改标准的机器学习体系结构,以允许神经模拟此过程。
translated by 谷歌翻译
从自主驾驶中的对象检测到细胞生物学中细胞形态的分析,需要在广泛的应用中提取区域提取。存在两种主要方法:凸船体提取,对于这些方法存在,并且存在精确有效的算法和凹形船体,它们更擅长捕获现实世界的形状,但没有单个解决方案。尤其是在均匀网格的背景下,凹面船体算法在很大程度上是近似的,牺牲区域的完整性,以实现空间和时间效率。在这项研究中,我们提出了一种新颖的算法,可以提供最大的顶点凹面壳,以最大的(即像素完美)分辨率,并且对于速度效率折衷方案而言是可调的。我们的方法在多个下游应用程序中提供了优势,包括数据压缩,检索,可视化和分析。为了证明我们方法的实际实用性,我们专注于图像压缩。我们通过对单个图像内的不同区域的上下文依赖性压缩(熵编码嘈杂和预测性编码的结构化区域编码)证明了显着改进。我们表明,这些改进范围从生物医学图像到自然图像。除了图像压缩之外,我们的算法还可以更广泛地应用于为数据检索,可视化和分析的广泛实用应用。
translated by 谷歌翻译
残疾人在医疗保健,就业和政府政策等各个领域的各种复杂的决策过程中受到各种复杂的决策。这些环境通常已经不透明他们影响的人并缺乏充分的残疾观点代表,它迅速采用人工智能(AI)技术来用于数据分析以告知决策,从而增加因不当或不公平的算法而造成的伤害风险增加。本文介绍了一个通过残疾镜头进行严格检查AI数据分析技术的框架,并研究了AI技术设计师选择的残疾定义如何影响其对残疾分析对象的影响。我们考虑了三种残疾的概念模型:医学模型,社会模型和关系模型;并展示在每个模型下设计的AI技术如何差异很大,以至于与彼此不相容和矛盾。通过讨论有关医疗保健和政府残疾福利中AI分析的常见用例,我们说明了技术设计过程中的特定考虑因素和决策点,这些因素和决策点影响了这些环境中的电力动态和包容性,并有助于确定其对边缘化或支持的方向。我们提出的框架可以作为对AI技术的深入批判性检查的基础,并开发用于残疾相关的AI分析的设计实践。
translated by 谷歌翻译